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METHOD AND SYSTEM FOR MAINTAINING FORWARD AND BACKWARD 
COMPATIBILITY IN FLATTENED OBJECT STREAMS 

5 BACKGROUND OF THE INVENTION 

1. Field of the Invention 

The present invention relates to an improved data 
processing system and, in particular, to a method and 
10 apparatus for object-oriented input/output, data storage, 
or communication. Still more particularly, the present 
invention provides a method and apparatus for version 
management of serialized object streams. 



15 2. Description of Related Art 

While executing" an object-oriented application, 
objects are created and then destroyed or otherwise 
freed; in a Java™ runtime environment, objects are 
eventually freed through garbage collection. In certain 
20 applications, rather than saving runtime data in 
M 1 persistent storage and then recreating objects, it is 

helpful to persist actual objects outside of the runtime 
environment. The object is then stored in persistent 
storage or communicated to another process or device. 
25 Using object serialization, an object can be 

persisted and then reused. Object serialization is the 
process of saving an object's state to a sequence of 
bytes as well as the process of rebuilding those bytes 
into a live object, although the latter is sometimes 
30 called "deserialization". The complete state of an 

object can be written to an output stream, and at some 
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later time, the object can be recreated by reading its 
serialized state from an input stream. Using a 
three-dimensional analogy for an object, a serialized 
object is often called a "flattened" object, while a 
5 recreated object is sometimes called an "inflated" 
obj ect . 

Java™ provides default mechanisms for persisting 
objects. As long as an object implements the 
"Serializable" interface or has a class within its 
10 hierarchy that implements the interface, an object can be 
serialized; a Java™ class can specify which attributes 
are not to be serialized/deserialized using the special 
Java™ keyword "transient" . The "Serializable" interface 
does not define any methods but merely acts as a marker 
15 that indicates whether serialization is allowed on a 

given object. An object can by serialized by passing the 
object to the "writeObject ( ) " method of the 
"Obj ectOutputStream" class, and the object can be 

e a j 

FU recreated by calling the "readObj ect ( ) " method of the 

m 

n 20 "ObjectlnputStream" class. This serialization behavior 

can be customized to a certain extent by implementing the 
"writeObject () " and "readObj ect ( ) " methods while still 
being able to rely on the default output and input 
mechanisms for writing and reading a flattened object. 

25 Instead of relying on the default mechanism, one can 

create a custom serialization mechanism by implementing 
the "Externalizable" interface, which defines only two 
methods , "writeExternal ( ) " and "readExternal ( ) " . By 
overriding these methods, one has complete control over 

30 the writing and reading of an object's state. When a 
class implements the "Externalizable" interface, the 
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"ObjectOutputStream" and u Obj ectlnputStream" classes use 
that class's "writeExternal ( ) " and "readExternal ( ) " 
methods to write and read the object's state to and from 
a flattened object stream. 
5 One potential problem with serializing an object is 

that a flattened object might persist within a file for a 
long period of time. Meanwhile, classes may change over 
time such that multiple versions of a class are used 
while the flattened object remains in persistent storage. 
10 When an attempt is made to read the flattened object, an 
error could occur because the version of the class that 
is being used to read the flattened object is not 
compatible with the version of the class that was being 
used when the object was flattened. 



^ 15 The Java™ environment contains some administrative 

€! support for detecting class versioning problems. When an 

01 

object is serialized, information about an object's class 
s is also serialized so that the correct class can be 

m 

m loaded when the object is deserialized. Generally, this 

2 20 information comprises the fully-qualified name of the 
M= class; in addition, all persistent-capable classes are 

given a version number. The version number of a class is 
stored in a field named "serialVersionUID" , and the 
version number is used to determine whether an object can 
25 be properly deserialized; if there is a version mismatch, 
the "java. io. InvalidClassException" will be thrown. 

If one needs to control class versioning, one can 
explicitly declare the "serialVersionUID" constant and 
ensure that it remains the same between compatible 
30 versions of a class; it should be assumed that one would 
not maintain the same "serialVersionUID" constant across 
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changes to a class that break serialization 
compatibility. If a class does not define a 
"serialVersionUID" constant, then the 
"ObjectOutputStream" class automatically generates a 
5 unique version number for the class by applying a hashing 
algorithm to information related to the class, such as 
the class name, its interfaces, fields, and methods. 
Changes that break serialization compatibility can then 
be detected through the "serialVersionUID" constant. For 
10 example, if a non-private class method signature is 
changed in a new version of the class, then the 
n automatically-computed unique version number would also 

~ change, and an exception would be thrown when an attempt 

60 

1 was made to read a flattened object stream into an 
!E 15 incompatible version of the object. 

y3 Although the administrative support for detecting 

class versioning problems is helpful to some extent, it 

2 does not provide any mechanism for recovery from this 
jTj incompatibility in class versions. 

2 20 Therefore, it would be advantageous to have a 

D 

N* methodology for facilitating object serialization and 

deserialization such that forward and backward 
compatibility can be maintained for flattened objects 
across different versions of a class. 

25 
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SUMMARY OF THE INVENTION 

A method, a system, an apparatus, and a computer 
program product are presented for object-oriented 
5 management of serializable objects. Within 

object-oriented programming languages, classes define a 
set of attributes and methods, which provide the state 
and behavior for instances of that class. For the 
purposes of this invention, the class would also define a 
10 class version number attribute that would serve to 

identify an object as an instance of a particular class 

~ version. In addition, each attribute in the set of 

□ 

C ! attributes is associated with an attribute version 

m 

number. The attribute version number represents the 

± 15 class version in which the associated attribute was first 

Q 

y3 declared within the class. When an object is written to 

y ' a data stream, all of the information contained within 

3 

□ that object would be written. The information written to 

m 

pj the stream would consist of a value identifying the 

5 20 object's class, the value of the class version attribute, 

□ 

U, a count of all of the non- transient attributes defined 

within the object, and every attribute along with its . 
associated version number. 

At some subsequent point in time, the data stream 

25 representing a serialized object may be read. An object 
will be instantiated in accordance with the class 
identifier from the data stream, at which point the class 
version number of the instantiated object and the class 
version number of the serialized object may differ. 

30 Serialized attribute values from the data stream are 

stored within the instantiated object in accordance with 
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the class version number and attribute version number 
relationships between the serialized object and the 
instantiated object. 
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BRIEF DESCRIPTION OP THE DRAWINGS 

The novel features believed characteristic of the 
invention are set forth in the appended claims. The 
invention itself, further objectives, and advantages 
thereof, will be best understood by reference to the 
following detailed description when read in conjunction 
with the accompanying drawings, wherein: 

Figure 1A depicts a typical distributed data 
processing system in which the present invention may be 
implemented; 

Figure IB depicts a typical computer architecture 
that may be used within a data processing system in which 
the present invention may be implemented; 

Figure 2A is a diagram showing a simple Person class 
that might be found in the prior art; 

Figures 2B-2C are diagrams depicting the operation 
of a standard serialization mechanism on two versions of 
the Person class in which the same version of the class 
is used for both serialization and deserialization; 

Figures 3A-3B are diagrams depicting the operation 
of a standard serialization mechanism on two versions of 
the Person class in which a different version of the 
class is used for serialization and deserialization on 
the same object stream; 

Figures 4A-4D are diagrams depicting the operation 
of a custom serialization mechanism for versions of the 
exemplary Person class in accordance with the present 
invention; 
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Figure 5 depicts a simple object model with a set of 
classes for illustrating an object-oriented methodology 
that supports serialization and deserialization with 
version management in accordance with a preferred 
embodiment of the present invention; 

Figure 6A is a flowchart depicting a serialization 
process in accordance with a preferred embodiment of the 
present invention; 

Figure 6B is a flowchart depicting a deserialization 
process in accordance with a preferred embodiment of the 
present invention; and 

Figures 7A-7D are diagrams depicting evolving 
versions of the Person class in conjunction with 
serialized data streams and an attribute value list that 
may be used as a data structure to assist with 
interpreting the implicit ordering of a data stream in 
accordance with a preferred embodiment of the present 
invention . 
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DETAILED DESCRIPTION OF THE INVENTION 



The present invention is directed to a system and a 
methodology for facilitating object serialization and 
deserialization such that forward and backward 
compatibility can be maintained for flattened objects 
across different versions of a class. These flattened 
objects may be persisted in one or more files that are 
dispersed throughout a network, or the flattened objects 
may be communicated between devices within a network. As 
background, a typical organization of hardware and 
software components within a distributed data processing 
system is described prior to describing the present 
invention in more detail. 

With reference now to the figures, Figure 1A depicts 
a typical network of data processing systems, each of 
which may contain and/or operate the present invention. 
Distributed data processing system 100 contains network 
101, which is a medium that may be used to provide 
communications links between various devices and computers 
connected together within distributed data processing 
system 100. Network 101 may include permanent 
connections, such as wire or fiber optic cables, or 
temporary connections made through telephone or wireless 
communications. In the depicted example, server 102 and 
server 103 are connected to network 101 along with storage 
unit 104. In addition, clients 105-107 also are connected 
to network 101. Clients 105-107 and servers 102-103 may 
be represented by a variety of computing devices, such as 
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mainframes, personal computers, personal digital 
assistants (PDAs), etc. Distributed data processing 
system 100 may include additional servers, clients, 
routers, other devices, and peer-to-peer architectures 
5 that are not shown. 

In the depicted example, distributed data processing 
system 100 may include the Internet with network 101 
representing a worldwide collection of networks and 
gateways that use various protocols to communicate with 
10 one another, such as Lightweight Directory Access Protocol 
(LDAP) , Transport Control Protocol/Internet Protocol 
(TCP/IP) , Hypertext Transport Protocol (HTTP) , Wireless 
Application Protocol (WAP), etc. Of course, distributed 
data processing system 100 may also include a number of 
ij 15 different types of networks, such as, for example, an 
intranet, a local area network (LAN) , or a wide area 
network (WAN) . For example, server 102 directly supports 
JsJ client 109 and network 110, which incorporates wireless 

rU communication links. Network-enabled phone 111 connects 

03 

Q 20 to network 110 through wireless link 112, and PDA 113 
^ connects to network 110 through wireless link 114. Phone 

111 and PDA 113 can also directly transfer data between 
themselves across wireless link 115 using an appropriate 
technology, such as Bluetooth™ wireless technology, to 
25 create so-called personal area networks (PAN) or personal 
ad-hoc networks. In a similar manner, PDA 113 can 
transfer data to PDA 107 via wireless communication link 
116. 

The present invention could be implemented on a 
30 variety of hardware platforms; Figure 1A is intended as an 
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example of a heterogeneous computing environment and not 
as an architectural limitation for the present invention. 

With reference now to Figure IB, a diagram depicts a 
typical computer architecture of a data processing system, 
such as those shown in Figure 1A, in which the present 
invention may be implemented. Data processing system 120 
contains one or more central processing units (CPUs) 122 
connected to internal system bus 123, which interconnects 
random access memory (RAM) 124, read-only memory 126, and 
input/output adapter 128, which supports various I/O 
devices, such as printer 130, disk units 132, or other 
devices not shown, such as a audio output system, etc. 
System bus 123 also connects communication adapter 134 
that provides access to communication link 136. User 
interface adapter 148 connects various user devices, such 
as keyboard 14 0 and mouse 142, or other devices not 
shown, such as a touch screen, stylus, microphone, etc. 
Display adapter 144 connects system bus 123 to display 
device 146. 

Those of ordinary skill in the art will appreciate 
that the hardware in Figure IB may vary depending on the 
system implementation. For example, the system may have 
one or more processors, including a digital signal 
processor (DSP) and other types of special purpose 
processors, and one or more types of volatile and 
non-volatile memory. Other peripheral devices may be 
used in addition to or in place of the hardware depicted 
in Figure IB. The depicted examples are not meant to 
imply architectural limitations with respect to the 
present invention . 
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In addition to being able to be implemented on a 
variety of hardware platforms, the present invention may 
be implemented in a variety of software environments. A 
typical operating system may be used to control program 
execution within each data processing system. For 
example, one device may run a Unix™ operating system, 
while another device contains a simple Java™ runtime 
environment. A representative computer platform may 
include a browser, which is a well known software 
application for accessing hypertext documents in a variety 
of formats, such as graphic files, word processing files, 
Extensible Markup Language (XML) , Hypertext Markup 
Language (HTML) , Handheld Device Markup Language (HDML) , 
Wireless Markup Language (WML) , and various other formats 
and types of files. 

The present invention may be implemented on a 
variety of hardware and software platforms, as described 
above. More specifically, though, the present invention 
is directed to a system and a methodology for facilitating 
object serialization and deserialization such that 
forward and backward compatibility can be maintained for 
flattened objects across different versions of a class. 

For the purposes of this invention, Java™ is used as 
the programming language to describe the problem and 
present the preferred embodiment. However, this 
invention is not restricted to the Java™ programming 
language and could be implemented in any environment that 
serializes and deserializes object instances. 

In general, a serialization compatibility problem is 
created when an attribute is added to a class. The 
present invention solves certain serialization 
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compatibility issues by associating a version number or 

identifier to each attribute. During serialization and 

deserialization, version numbers of each attribute are 

compared. For forward compatibility, i.e., when a newer 

5 class attempts to read older class data, the older class 

data is read in a manner that is determined by its 

version number while new attributes within the newer 

version are provided default values. For backward 

compatibility, i.e., when an older class attempts to read 

10 newer class data, newer attributes can be ignored. It 

should be noted that the examples discuss version 

_ numbers, but one of ordinary skill in the art would 

□ 

y3 understand that various type of identifiers, including 

m 

numbers, could also be used. The methodology is 
=1= 15 hereinafter described in more detail with respect to the 

0 . . 

remaining figures. 

With reference now to Figure 2A, a diagram shows a 
□ simple Person class that might be found in the prior art. 

= : Figure 2A contains a representation of two versions of 

5 20 the Person class. For the following examples, these two 

Q 

y, versions of the Person class will be used to demonstrate 

typical serialization and deserialization versus the 
manner in which the present invention performs 
serialization and deserialization. 

25 A preliminary Version 1 of the Person class 202 has 

a preliminary set of attributes 204. A subsequent 
Version 2 of the Person class 206 has a modified set of 
attributes 208 that includes an additional attribute that 
has been added to the class definition, "middleName" 

30 attribute 210. In Figure 2A, the attributes are shown 
without values. 
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With reference now to Figures 2B-2C, a set of 
diagrams show the operation of a standard serialization 
mechanism on two versions of the Person class in which 
the same version of the class is used for both 
5 serialization and deserialization. In Figures 2B-2C, the 
attributes are associated with values, which represent 
the actual value given to the attributes of the class 
instance (or object) . 

Figure 2B shows object 210, which is structured in 
10 accordance with Version 1 of the Person class. Object 
210 undergoes serialization operation 212 to create 
□ flattened object data stream 214. Data stream 214 then 

undergoes deserialization operation 216 to inflate object 
218, which can be seen to be a copy of the original 



Cfl 



Q 15 object 210 



Data stream 214 is an abstraction of the actual 

* destination of the serialized object and is not a 

O 

01 byte-wise copy of an actual serialized object. The 

flattened object data stream can be formatted in many 



ru 



Q 20 ways that may be dependent upon a variety of factors with 
respect to the runtime environments, the reading/writing 
applications, etc., that use the object data stream. 
Data stream 214 represents a file, a network connection 
through which the object is transmitted to another 
25 computer, or possibly a memory location where the object 
is held. 

Figure 2C shows object 220, which is structured in 
accordance with Version 2 of the Person class. Object 
220 undergoes serialization operation 222 to create 
30 flattened object data stream 224. Data stream 224 then 
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undergoes deserialization operation 226 to inflate object 
228, which can be seen to be a copy of the original 
object 220. 

The simplest and most frequent scenario for 
5 serializing an object occurs when the version of the 

class that is read out of the stream is the same as the 
version that was written. This scenario, as it would 
appear using the standard serialization mechanism, is 
represented in Figures 2B-2C. As the figures show, when 
10 the object is serialized, information that identifies the 
object's class and the value of the object's attributes 
O are placed into the object stream. When the object is 

j|j deserialized, the class identifier is read, and an object 

*Q is instantiated in accordance with the identified class, 

g 15 The object is then initialized with the attribute values 
as they are read out of the stream. 

With reference now to Figures 3A-3B, a set of 
diagrams show the operation of a standard serialization 

Ty mechanism on two versions of the Person class in which a 

ffl 

p 20 different version of the class is used for serialization 
^ and deserialization on the same object stream. While the 

standard serialization mechanism is very simple and 
straightforward, it has an inherent problem when 
confronted with operating on different versions of a 
25 class. Figures 3A-3B display two scenarios in which 
default serialization operations result in an error. 

Figure 3A shows an instance of a newer version of a 
class that is written to the data stream and a back-level 
version that is available when the information is read 
30 out of the stream. This type of scenario might occur 
when an application on the receiving end of a 
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serialization stream has not been updated with a modified 
version of a class. Object 302 is structured in 
accordance with Version 2 of the Person class and 
undergoes serialization operation 304 to create flattened 
5 object data stream 306. However, when data stream 306 
then undergoes deserialization operation 308 in an 
attempt to instantiate object 310, which is structured in 
accordance with Version 1 of the Person class, an error 
would be detected, and the deserialization operation 
10 would not be completed. 

Figure 3B shows a similar situation in which an 
™ older version of the class is written to the data stream, 

pj and when the data is read out of the data stream, an 

attempt is made to instantiate an object in accordance 
O 15 with a more recent version of the object's class. This 

. r-i 

S type of scenario might occur when an application on the 

writing or transmitting end of a serialization stream has 
not been updated with the most recent version of a class, 
yet the receiving application has been updated. Object 
O 20 322 is structured in accordance with Version 1 of the 

Person class and undergoes serialization operation 324 to 
create flattened object data stream 326. However, when 
data stream 326 then undergoes deserialization operation 
328 in an attempt to instantiate object 330, which is 
25 structured in accordance with Version 2 of the Person 

class, an error would be detected and the deserialization 
operation would not be completed. 

In both scenarios in Figure 3A and Figure 3B, the 
serialization fails when the object data is read out of 
30 the data stream because the information contained within 
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the data stream is structured differently than expected. 
While these scenarios do not occur in the usual operation 
of an application, they may occur at any time after 
updates are made to objects that are being serialized. 
5 Distributed applications, i.e., those applications in 
which a part of the application runs on one computer and 
another part runs elsewhere, are particularly susceptible 
to this serialization problem since most of their 
communication may comprise object streams. To remedy the 
10 situation, costly and time-consuming updates are required 
to the application on every system in order to avoid the 

_ serialization errors using the standard serialization 

O 

gg mechanism. 

rn * * * 

^ With reference now to Figures 4A-4D, a set of 

£ 15 diagrams show the operation of a custom serialization 

y3 mechanism for versions of the exemplary Person class in 

accordance with the present invention. The custom 



m 



D serialization mechanism of the present invention corrects 

m 

ry the versioning problems described above with respect to 

20 Figures 3A-3B by associating additional information with 
fc* attributes in the stream. Figures 4A-4D displays the 

same scenarios as above except that the serialization 
mechanism of the present invention is used to flatten and 
inflate the objects. In addition to each class receiving 
25 a version number, as is performed in the prior art, each 
attribute is also given a version number to identify the 
class version in which the attribute first appeared. 

The values preceding the attributes in the data 
stream within parentheses are the version numbers. 
30 Again, the examples merely represent data streams and are 
not byte-wise copies of actual data streams. 
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In the present invention, when an object is written 
to a flattened object data stream, attribute version 
numbers are also placed in the data stream in addition to 
the class version number. When the object is later read 
5 from the data stream, the attribute version number is 

compared to the current class version number of the newly 
created object and not to the class version number from 
the data stream. Only those attributes that are usable 
within the current class version are used for the new 
10 object. If the object that is instantiated when the 
class identifier is read from the stream is of a newer 

« class version than the class version of the object that 

U 

y3 ' was serialized into the stream, then any new attributes 

jj=i are initialized to a default value. 

4f' 15 Figure 4A shows object 402, which is structured in 

□ 

y3 accordance with Version 1 of the Person class. Object 

w ' 402 undergoes serialization operation 404 to create 



flattened object data stream 406. At some later point in 
flj time, data stream 406 then undergoes deserialization 

20 operation 408 to inflate newly instantiated object 410, 
which can be seen to be a copy of the original object 
402 . 

Figure 4B shows object 432, which is structured in 
accordance with Version 2 of the Person class. Object 
25 432 undergoes serialization operation 434 to create 

flattened object data stream 436. At some later point in 
time, data stream 436 then undergoes deserialization 
operation 438 to inflate newly instantiated object 440, 
which can be seen to be a copy of the original object 
30 432. 
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Figure 4C shows object 452, which is structured in 
accordance with Version 2 of the Person class, which 
includes the "middleName" attribute that was introduced 
in Version 2 of the Person class but did not exist in 
5 Version 1 of the Person class. Object 452 undergoes 
serialization operation 454 to create flattened object 
data stream 456. At some later point in time, data 
stream 456 then undergoes deserialization operation 458 
in an attempt to read the attribute values into newly 
10 instantiated object 460. However, the receiving/reading 
application has instantiated object 460 as an object of 
Version 1 of the Person class, whereas data stream 456 
was created by flattening an object of Version 2 of the 
^2 Person class. When the "middleName" attribute is 

□ 15 encountered, its Version 2 tag is detected to be greater 

m 



GO 



Q 

m 
ru 



than the version of the object into which it is being 
read, and the attribute value is ignored with respect to 
object 460. 

Although in the case of Figure 4C some information 



O 20 is lost, the receiving/reading application was not 

expecting the information, so from the perspective of 
that application, no information is lost. In addition, 
no error occurs upon detecting a version mismatch. 

Figure 4D shows object 472, which is structured in 

25 accordance with Version 1 of the Person class, which does 
not include the "middleName" attribute because it was 
introduced in Version 2 of the Person class and did not 
exist in Version 1 of the Person class. Object 472 
undergoes serialization operation 474 to create flattened 

30 object data stream 476. At some later point in time, 
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data stream 476 then undergoes deserialization operation 
478 in an attempt to read the attribute values into newly 
instantiated object 480. However, the receiving/reading 
application has instantiated object 480 as an object of 
5 Version 2 of the Person class, whereas data stream 476 
was created by flattening an object of Version 1 of the 
Person class. The "middleName" attribute of object 480 
is associated with a Version 2 tag, which is detected to 
be greater than the version of the object from which the 
10 data stream is being read, and the attribute value is 
given a default value. The deserialization object may 
Q then proceed to read attribute value from the data stream 

for the other attributes of object 480 without further 



y=? interference. 

q 15 Although in the case of Figure 4D some information 

— is not received that might be received in other 

y > 

s scenarios, the receiving/reading application should be 

□ 

ff: able to proceed because all attributes have a valid 

^ value. This situation should be tolerable because the 

80 

P 20 attribute that received the default value did not exist 
^ in the previous version of the class; therefore, the 

attribute value should not be regarded as critical. In 
other words, if the first version of the class was 
acceptable, then subsequent versions of the class should 
25 be tolerable, and the application developer should keep 
such potential restrictions in mind when modifying class 
definitions. The present invention solves a problem that 
exists within the prior art because no error occurs upon 
detecting a version mismatch. If the application 
30 developer or user desires to be notified when a version 
mismatch is detected, then an error message of some form 
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can be generated or logged. In addition, a dialog box 
could be presented if the application has a graphical 
user interface, and the user of the application might be 
given an option of proceeding or halting when a version 
5 mismatch is detected. 

With reference now to Figure 5, a simple object 
model depicts a set of classes for illustrating an 
object-oriented methodology for supporting serialization 
and deserialization with version management in accordance 
10 with a preferred embodiment of the present invention. In 
the preferred embodiment, each class is responsible for 
serializing and deserializing its own attributes. While 
S most of this logic can be placed in a common superclass, 

the implementation classes must provide the logic 
g 15 pertaining to the specifics of their own attributes. 

Figure 5 depicts a simple object model. Customer 
class 502 is a subclass of Person class 504, which is a 
™ subclass of BaseObject class 506. Each class defines a 

fU versionID attribute that contains the version identifier 

a 

p 20 for the particular class. Each class then defines its 
^ instance attributes, i.e., the attributes that pertain to 

the business function of the class. Each instance 
attribute is paired with another attribute that 
represents the attribute's version information. The 
25 naming convention for the version attributes that has 

been used within this example is to append "VersionID" to 
the paired instance attribute's name. For example, the 
BaseObject class defines an instance attribute named 
"databaselD" ; following the naming convention, the 
30 version attribute that is associated with attribute 
"databaselD" is named "databaseldVersionID" . 
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In the preferred embodiment of the present 
invention, each Java™ class implements the 
u writeObj ect ( ) " and "readObj ect ( ) " methods to perform the 
object serialization and deserialization. 

Within these methods, each class is responsible for 
writing and reading their attributes to and from a 
flattened object data stream. For example, from the 
class representation in Figure 5, the Customer class 
writes and reads the contents of the "customerNumber" 
attribute to and from the data stream but does take any 
actions to serialize or deserialize the "name" and the 
"databaselD" attributes in the other classes. The 
serialization of the "name" and "databaselD" attributes 
is left to the classes in which those are defined, namely 
Person and BaseObject, respectively. 

In accordance with standard Java™ serialization 
methodology, when an object is serialized, the 
"writeObject ( ) " method of the object's class would be 
invoked, which would receive the stream instance as an 
input argument so that the object's contents can be 
placed in the stream. However, with the present 
invention, when the u writeObj ect ( ) " method is called, as 
a first action, the method invokes the "writeObj ect ( ) " 
method of its superclass, which ensures that the 
superclass' attributes are placed in the stream; all of 
the attributes of the superclass (or superclasses) are 
placed into the stream until the base class is reached. 

Next, the methodology described above with respect 
to Figures 4A-4D is performed so that the object's 
attribute information is written to the stream. Every 
instance attribute written to the stream is an Object. 
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No Java™ primitive data types are written to the stream 
so that every attribute in the stream can be handled 
generically. For example, assume that an instance of the 
Customer class is serialized to the flattened object data 

5 stream. After invoking the superclass "writeObject ( ) " 
method, which in this case would be the "writeObj ect ( ) " 
method for the Person class, which in turn would invoke 
the "writeObject ( ) " method of the BaseObject class, the 
"writeObject ( ) " method defined by the Customer class 

10 would: (1) write the value of the Customer class' 

"versionID" object; (2) write the attribute count of the 
Customer class, which in this case is equal to one; (3) 



J3 write the value of "customerNumberVersionID" ; and (4) 

m 



G3 

^ write the value of the "customerNumber" attribute 



m 
ru 



± 15 When an object is deserialized, the "readObj ect ( ) " 

S 

y3 method would be invoked with a stream instance containing 

" ! a flattened object data stream for the serialized object. 

Similar to the operations that occur during 
serialization, with the present invention, when the 

W 20 "readObject () " method is called, as a first action, the 

£3 

'y, method invokes the "readObject ( ) " method of its 

superclass, which insures that the superclass' attributes 
are retrieved from the stream. 

Next, the methodology described above with respect 

25 to Figures 4A-4D for deserializing an object is performed 
so that the object's attribute information is read from 
the stream. Continuing with the example of using the 
Customer class, after invoking the "readOb j ect ( ) " method 
on the superclass, which in this case would be the 

30 "readObject ( ) " method for the Person class, the Customer 
class' "versionID" attribute and the attribute count 
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would be read from the data stream. Based on the 
attribute count, each version attribute and instance 
attribute are read from the stream. Because the order in 
which the attributes have been written is known, i.e., 
there is an implicit ordering to manner in which the 
attributed values are placed into and read from the data 
stream, the values that are read from the stream can be 
mapped to the attributes defined by the class. 

With reference now to Figure 6A, a flowchart depicts 
a serialization process in accordance with a preferred 
embodiment of the present invention. At some point in 
time, a u writeOb j ect ( ) " method is invoked to serialize an 
object into a particular flattened object data stream, as 
described with respect to Figure 5 above; it may be 
assumed that the "writeObj ect ( ) " method for the object's 
superclass has already been invoked, which would have 
invoked the "writeObject ( ) " method of its superclass and 
so on until the BaseObject class has been reached such 
that all of the attributes of the superclasses are placed 
into the stream. The serialization process begins by 
writing the class identifier into the data stream (step 
602) , followed by writing the class version number (step 
604) and the number of attributes for the class into the 
data stream (step 606) . A loop is then entered in which 
each attribute of the class is processed. The next 
unwritten attribute of the class is determined (step 
608). The attribute's version number is written to the 
data stream (step 610), and then the attribute's value is 
written to the data stream (step 612) . A determination 
is then made as to whether or not any unwritten 
attributes remain to be processed (step 614) . If so, 
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then the process branches back to step 608 to process 
another unwritten attribute. Otherwise, if there are no 
unwritten attributes to be processed, then the 
serialization process is complete. 
5 With reference now to Figure 6B, a flowchart depicts 

a deserialization process in accordance with a preferred 
embodiment of the present invention. At some point in 
time, a "readObj ect ( ) " method is invoked to deserialize 
an object from a particular flattened object data stream, 
10 as described with respect to Figure 5 above. The 
deserialization process begins by reading the class 
identifier from the data stream (step 622), after which 

»_j 

~ an instance of the class is created using the class 

ifl identifier (step 624) . The version number of the class 

£j 15 of the flattened object is then read from the data stream 
(step 626), which is used for subsequent comparisons. 

The number of attributes for the object is then read 
from the data stream (step 628) , which is then used as a 

W control variable to loop through the remaining attributes 

fn 

p 20 in the data stream for the current object (steps 
^ 630-638) . It should be noted that the attribute count 

was not depicted in Figures 4A-4D, and other equivalent 
indicators could be inserted into the data stream to 
control the demarcation of the attributes for the current 
25 object within the data stream. 

For each attribute, the attribute version number is 
read (step 630), followed by the attribute value for the 
attribute (step 632) . A determination is then made as to 
whether the version number of the attribute is less than 
30 or equal to the version number of the class for the newly 
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instantiated object (step 634) . If the version number of 
the attribute is indeed less than or equal to the version 
number of the newly instantiated object, then the 
attribute value that has been read from the data stream 
is added to the list of attribute values to be assigned 
to the attributes within the newly instantiated object 
(step 636) . If the version number of the attribute is 
greater than the version number of the class for the 
newly instantiated object, then the attribute may be 
ignored, and the loop continues. In either case, a 
determination is then made as to whether or not there are 
more attributes in the serialized data stream to be 
processed (step 638) . If so, then the process branches 
back to step 630; otherwise, the process continues. 

After the process has looped through all of the 
attributes in the data stream, then a determination is 
made as to whether or not the version number of the 
flattened object that was read from the data stream is 
less than the class for the newly instantiated object 
(step 640) . If not, then the process is complete; if so, 
then the process sets all of the newer attributes to a 
default attribute value (step 642) , which may require a 
controlled loop through a set of attributes, after which 
the process is complete. 

As shown above, the present invention uses 
customized serialization and deserialization methods to 
control the input and output of attribute values to and 
from a serialized data stream. As mentioned briefly 
above, these customized methods use an implicit ordering 
to identify the attribute values in a preferred 
embodiment. Alternatively, the serialization and 
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deserialization methods may input and output identifiers 
with the attribute values so that an implicit ordering is 
not required. By adding attribute identifiers into the 
stream, one increases the robustness and opportunity for 
5 checking for errors within the data stream, at the 

expense of increasing the size of the data stream. This 
is a trade-off that should be considered within the 
constraints of the system. 

To ensure that an implicit ordering mechanism 
10 operates correctly, the serialization method and the 
deserialization method, e.g., "writeObject ( ) " and 
"readObj ect ( ) " , would read and write attribute values in 

y3 the same order. 

£3 

tf i In order to ensure that the implicit ordering 

^ 15 mechanism operates in a straightforward manner, new 

O 

attributes that are added to subsequent versions of a 



UJ 



class could be represented within a serialized data 



□ stream by appending those values at the end of a data 

stream. In addition, when a software developer desires 

ES 20 to discard an attribute from a class, the attribute is 

O 

not entirely removed from all operations with respect to 
the data stream. Instead, an empty placeholder continues 
to be written and read from the data stream to ensure 
compatibility between differing versions of the 
25 serialization and deserialization methods that are using 
an implicit ordering mechanism. In this way, the data 
attributes are not truly removed from the class, but 
rather, those attributes cease to be utilized within the 
application . 

30 For example, in the description of Figures 4C-4D 

above, Version 1 of the Person class differed from 
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Version 2 of the Person class with the inclusion of a 
"middleName" attribute in Version 2, and this attribute 
was inserted into the data stream between other Version 1 
attributes. If the present invention were to be 
5 implemented through the use of an implicit ordering 

mechanism, the "middleName" attribute would be appended 
at the end of the data stream. The implicit ordering 
mechanism is shown in more detail below with respect to 
the remaining figures. 
10 With reference now to Figures 7A-7D, a set of 

diagrams shows evolving versions of the Person class in 
conjunction with serialized data streams and an attribute 

*S value list that may be used as a data structure to assist 

\y 

q with interpreting the implicit ordering of a data stream 

2 15 in accordance with a preferred embodiment of the present 
invention. Referring to Figure 7A, object 702 is 
structured in accordance with Version 1 of the Person 
0 class in a manner similar to object 202 shown in Figure 

m 

Rj 2A. Object 702 may represent a Version 1 Person object 

ffi 

q 20 either prior to being flattened or after being inflated, 
M 1 while serialized data stream 704 represents the object 

after it has been flattened. 

While either writing or reading object 702 from data 
stream 704, the attributes for object 702 may be stored 
25 within attribute value list 706, which may be implemented 
as a multidimensional array. Attribute value list 706 
contains attribute values 708 and their associated 
version numbers 710. Empty attributes 712 and unused 
version numbers 714 may or may not be used within an 
30 attribute value list. Version numbers 710 may be 
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statically defined within the serialization and 
deserialization methods. 

Referring to Figure 7B, object 722 is structured in 
accordance with Version 2 of the Person class in a manner 
5 similar to object 206 shown in Figure 2A. Object 722 may 
represent a Version 2 Person object either prior to being 
flattened or after being inflated, while serialized data 
stream 724 represents the object after it has been 
flattened. 

10 While either writing or reading object 722 from data 

stream 724, the attributes for object 722 may be stored 
within attribute value list 726. Attribute value list 
726 contains attribute values 728 and their associated 
version numbers 730. Version 2 of the Person class has 
O 15 added the "middleName" attribute, which is appended to 

the attribute value list. When changes are made to class 
to create a different version of the class, the software 
Iji developer modifies the code for the serialization and 

deserialization methods in a manner such that the 
p 20 attribute values are stored and retrieved from the proper 



□ 



locations within the attribute value list. 

Referring to Figure 7C, object 742 is structured in 

accordance with Version 3 of the Person class. Object 

742 may represent a Version 3 Person object either prior 
25 to being flattened or after being inflated, while 

serialized data stream 744 represents the object after it 

has been flattened. 

While either writing or read object 742 from data 

stream 744, the attributes for object 742 may be stored 
30 within attribute value list 746. Attribute value list 
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746 contains attribute values 748 and their associated 
version numbers 750. Version 3 of the Person class has 
added the "bldg" attribute to Version 2 of the Person 
class for specifying a building number, which is also 
5 appended to the attribute value list. As should be 

apparent, when new attributes are added to a class, they 
are stored within the data structure for the attribute 
value list such that an implicit ordering is maintained 
across versions of the class. 
10 Referring to Figure 7D, object 762 is structured in 

accordance with Version 4 of the Person class. Object 
p 762 may represent a Version 4 Person object either prior 

S to being flattened or after being inflated, while 

C 5 serialized data stream 764 represents the object after it 

q 15 has been flattened. 

While either writing or read object 762 from data 



03 



s stream 764, the attributes for object 762 may be stored 

P 

IT} within attribute value list 766. Attribute value list 

2 766 contains attribute values 768 and their associated 

CO 

O 20 version numbers 770. Version 4 of the Person class no 

M« 

longer contains the "middleName" attribute of the Person 
class. As should be apparent, when attributes are 
deleted from a class, the data stream and the data 
structure for the attribute value list retain a 

25 placeholder for the removed attribute such that an 

implicit ordering is maintained across versions of the 
class. In this example, version number 768 has been set 
to zero as a flag that the attribute has been removed. 

By retaining placeholders for removed attributes, an 

30 implicit ordering is maintaining across versions of a 
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class, and the deserialization method can readily map 
attributes between different versions of objects when 
there is a mismatch between a flattened object and the 
newly instantiated object that is being inflated using 
5 the values from the flattened object's data stream. When 
necessary, attributes can be given default values. It 
should be apparent to one of ordinary skill in the art 
that temporary data structures, such as an object array, 
can be employed to implement the process described above. 
10 The advantages of the present invention should be 

apparent in view of the detailed description of the 
invention that is provided above. In the prior art, an 
y3 error would be thrown when a version mismatch was 

^ detected between the flattened and the inflated data 

=P 15 objects. In contrast, the present invention associates 

Q 

£3 version numbers or version identifiers with each 

attribute and places the attribute version numbers into a 
flattened object's data stream. When the object is later 
read from the data stream, the attribute version numbers 

53 20 can be used to determine the manner in which the 

P 

attribute values should be processed. Only those 
attribute values that are appropriate for the current 
class version are used for the new object, while missing 
attribute values can be set to default values. Hence, 

25 the present invention does not return an error when 
version mismatches are detected. 

It is important to note that while the present 
invention has been described in the context of a fully 
functioning data processing system, those of ordinary 

30 skill in the art will appreciate that some of the 
processes associated with the present invention are 
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capable of being distributed in the form of instructions 
in a computer readable medium and a variety of other 
forms, regardless of the particular type of signal 
bearing media actually used to carry out the 
distribution. Examples of computer readable media 
include media such as EPROM, ROM, tape, paper, floppy 
disc, hard disk drive, RAM, and CD-ROMs and 
transmission- type media, such as digital and analog 
communications links . 

The description of the present invention has been 
presented for purposes of illustration but is not 
intended to be exhaustive or limited to the disclosed 
embodiments. Many modifications and variations will be 
apparent to those of ordinary skill in the art. The 
embodiments were chosen to explain the principles of the 
invention and its practical applications and to enable 
others of ordinary skill in the art to understand the 
invention in order to implement various embodiments with 
various modifications as might be suited to other 
contemplated uses. 



